Reinforcement Learning Requires Human-in-the-Loop Framing and Approaches

نویسندگان

چکیده

Reinforcement learning (RL) is typically framed as a machine paradigm where agents learn to act autonomously in complex environments. This paper argues instead that RL fundamentally human the loop (HitL). The reward functions (and other components) of Markov decision process are defined by humans. decisions tackle certain problem, and deploy learned solution, taken Humans can also play critical role providing information agent throughout its life cycle better succeed at problem question. We end highlighting set HitL research questions, which, if ignored, could cause fail live up full potential.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Agent-Agnostic Human-in-the-Loop Reinforcement Learning

Providing Reinforcement Learning agents with expert advice can dramatically improve various aspects of learning. To this end, prior work has developed teaching protocols that enable agents to learn efficiently in complex environments. In many of these methods, the teacher’s guidance is tailored to agents with a particular representation or underlying learning scheme, offering effective but high...

متن کامل

Ultrasonic and Cooling Approaches for Reinforcement of the Microextraction Methods

Since solid-phase and liquid-phase microextraction (SPME and LPME) were introduced, as effective solvent-free methods, many efforts have been made to improve their modes and applications. However, due to limitations with sensitivity and efficiency, researchers have focused on improving the performance of their basic primary modes. In this way, in recent years, different methods such as ultrason...

متن کامل

Reinforcement Learning for Mixed Open-loop and Closed-loop Control

Closed-loop control relies on sensory feedback that is usually assumed to be free . But if sensing incurs a cost, it may be costeffective to take sequences of actions in open-loop mode. We describe a reinforcement learning algorithm that learns to combine open-loop and closed-loop control when sensing incurs a cost. Although we assume reliable sensors, use of open-loop control means that action...

متن کامل

willingness to communicate in the iranian context: language learning orientation and social support

why some learners are willing to communicate in english, concurrently others are not, has been an intensive investigation in l2 education. willingness to communicate (wtc) proposed as initiating to communicate while given a choice has recently played a crucial role in l2 learning. it was hypothesized that wtc would be associated with language learning orientations (llos) as well as social suppo...

learners’ attitudes toward the effectiveness of mobile-assisted language learning (mall) in vocabulary acquisition in the iranian efl context: the case of word lists, audiobooks and dictionary use

رشد انفجاری تکنولوژی فرصت های آموزشی مهیج و جدیدی را پیش روی فراگیران و آموزش دهندگان گذاشته است. امروزه معلمان برای اینکه در امر آموزش زبان بروز باشند باید روش هایی را اتخاذ نمایند که درآن ها از تکنولوژی جهت کمک در یادگیری زبان دوم و چندم استفاده شده باشد. با در نظر گرفتن تحولاتی که رشته ی آموزش زبان در حال رخ دادن است هم اکنون زمان مناسبی برای ارزشیابی نگرش های موجود نسبت به تکنولوژی های جدید...

15 صفحه اول

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Frontiers in artificial intelligence and applications

سال: 2023

ISSN: ['1879-8314', '0922-6389']

DOI: https://doi.org/10.3233/faia230098